Active Grounding of Visual Situations

نویسندگان

  • Max H. Quinn
  • Erik Conser
  • Jordan M. Witte
  • Melanie Mitchell
چکیده

We address a key problem for computer vision: retrieving images that are instances of visual situations. Visual situations are concepts such as “a boxing match”, “a birthday party”, “walking the dog”, “a crowd waiting for a bus,” “a handshake”, or “a game of ping-pong,” whose instantiations in images are linked more by their common spatial and semantic structure than by low-level visual similarity. While computer vision has made remarkable progress on recognizing individual objects in images, the problem of visual situation recognition is much more difficult for many reasons, including the vast variability of possible instances of a given situation, as well as the combinatorics of evaluating possible pairwise or multiple-object relationships. In this paper we describe a novel architecture we have developed for visual situation retrieval. Given a situation description, our architecture—called Situate—learns models capturing the visual features of expected objects as well as probabilistic spatial models capturing the expected spatial configuration of relationships among objects. Given a new image, Situate uses these models in an attempt to ground (i.e., to create a bounding box representing) each expected component of the situation in the image via an active search procedure. Situate uses the resulting grounding to compute a score indicating the degree to which the new image is judged to contain an instance of the situation. Such scores can be used to rank images in a collection as part of a retrieval system. In the preliminary study described here, we demonstrate the promise of this system—and the importance of active grounding—by comparing Situate’s retrieval and grounding performance on one example situation category with that of two baseline methods, as well as with a related image-retrieval system based on “scene graphs”.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Image Retrieval via Active Grounding of Visual Situations

We describe a novel architecture for semantic image retrieval—in particular, retrieval of instances of visual situations. Visual situations are concepts such as “a boxing match,” “walking the dog,” “a crowd waiting for a bus,” or “a game of pingpong,” whose instantiations in images are linked more by their common spatial and semantic structure than by low-level visual similarity. Given a query ...

متن کامل

Cultural grounding of regret: regret in self and interpersonal contexts.

The purpose of this study was to explore cultural similarities and differences in regret, focusing on distinctions between interpersonal and self-situations, and between action and inaction regrets. Japanese and American undergraduates were asked to describe regrets experienced in interpersonal and self-situations. We found that both situational and cultural contexts influenced the likelihood o...

متن کامل

Back-flashover Investigation of HV Transmission Lines Using Transient Modeling of the Grounding Systems

The article presents the transients analysis of the substation grounding systems and transmission line tower footing resistances which can affect to the back-flashover (BF) or overvoltage across insulator chain in an HV power systems by using EMTP-RV software. The related transient modeling of the grounding systems is based on a transmission line (TL) model with considering the soil ionization....

متن کامل

Electrical and Thermal Analysis of Single Conductor Power Cable Considering the Lead Sheath Effect Based on Finite Element Method

This paper investigates the effect of metallic sheaths on losses and temperature of medium voltage power cables. Two grounding methods of sheaths, including both ends bonding and single point bonding that causes different situations on cable ampacity, are considered. Electrical losses of cables that are main sources of heat are calculated in both conductor and metallic sheath of the cables. She...

متن کامل

Numbers in Space: Differences between Concrete and Abstract Situations

Numbers might be understood by grounding in spatial orientation, where small numbers are represented as low or to the left and large numbers are represented as high or to the right. We presented numbers in concrete (seven shoes in a shoe shop) or abstract (29 - 7) contexts and asked participants to make relative magnitude judgments. Following the judgment a target letter was presented at the to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017